T 



ID) 



. RAW SEQUENCE LISTING DATE: 04/16/2002 

PATENT APPLICATION: US/10/076,157 TIME: 14:33:26 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\04162002\J076157.raw 

3 <110> APPLICANT: Pompejus, Markus 

4 Suelberger, Harald 

5 Joeffken, Hans Wolfgang 

6 Doval, Jose Luis Revuelta 

7 Jimenez , Alberto ; 

8 Garcia, Maria Angeles Santos 

10 <120> TITLE OF INVENTION: Genes of purine biosynthesis from Ashbya Gossypii 

use thereof 

11 in 

12 microbial riboflavin synthesis 
14 <130> FILE REFERENCE: 48684DIV 

16 <140> CURRENT APPLICATION NUMBER: US 10/076,157 

17 <141> CURRENT FILING DATE: 2002-02-15 

19 <150> PRIOR APPLICATION NUMBER: US 09/212,247 

20 <151> PRIOR FILING DATE: 1998-12-16 
22 <160> NUMBER OF SEQ ID NOS : 21 

24 <170> SOFTWARE: WordPerfect v. 6.1 

26 <210> SEQ ID NO: 1 

27 <211> LENGTH: 1911 

28 <212> TYPE: DNA 

29 <213> ORGANISM: Ashbya gosypii 

31 <220> FEATURE: 

32 <221> NAME/KEY: CDS 

33 <222> LOCATION: 626.. 1582 

35 <400> SEQUENCE: 1 ^ ^ an 

37 ggtagtcgct catcgacaga cacaatcgcg tgttctctct gaatcgtcca ttgggtgtca 60 
39 gcatcctgat cgcgggcgga tggaatgggt aatcattagg aaacaccaat gtcccatggt 
41 attgtccgtc ctcgtatggt gtctcaggag gacccgtgat cacgtagtgc cacaccagga 
43 tattgtcttc ctttggtgct gccacgatgt agggcggggg gttctcggtc atcattttgt 
45 actcctttga gagccgcttg tacgcctgtc ttgatgccat cttgcctact attagtttct 
47 caccacttcc cgccaaacaa tctgcacttt acgagcgcta tctatccctc g^f c^^tct 
49 agttgattat tggcgaaact gatagttcag gtacttccat gatgcggtca tatccacgta 
51 tgtgatcacg tgatcatcag ccatgctgcc agctcacggg cctgcctaca ctattggagg 
53 ctctgtgagt catgatttat tgcatatcaa gcccagatag tcgttgggga tactaccgtt 
55 gccgcgatga gctccgatat taagttgtag ccaaaaattt taacggatga cttcttaaca 
^ ^ . ^ ^4-^ 4.^^ -i-i-i/i aji-h arro fit a aaa ctQ cta 



120 
180 
240 
300 
360 
420 
480 
540 
600 
652 



ij!) gccgcgai.gd y uu^j^jycxuau ^ - 

57 gttattgacg ccgcaatcct acgcc atg teg tec aat age ata aag ctg cta 
5Q ^ ^ ^ Met Ser Ser Asn Ser He Lys Leu Leu 

59 ^ ^ inn 

61 gca ggt aac teg cac ccg gac cta get gag aag gtc tec gtt cgc cta 700 

62 Ala Gly Asn Ser His Pro Asp Leu Ala Glu Lys Val Ser Val Arg Leu 

63 10 15 

65 ggt gta cca ctt teg aag att gga gtg tat cac tac tct aac aaa gag 748 

66 Gly val Pro Leu Ser Lys He Gly Val Tyr His Tyr Ser Asn Lys Glu 

67 30 35 40 
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69 acg tea gtt act ate ggc gaa agt ate egt gat gaa gat gtc tac ate 796 

70 Thr ser Val Thr He Gly Glu Ser He Arg Asp Glu Asp Val Tyr He 

cn SS 

71 4D 

73 ate cag aca gga acg ggg gag eag gaa ate aac gae ttc etc atg gaa 

74 He Gin Thr Gly Thr Gly Glu Gin Glu He Asn Asp Phe Leu Met Glu 

75 60 65 70 

77 ctg etc ate atg ate cat gee tge egg tea gee tct gcg egg aag ate 

78 Leu Leu He Met He His Ala Cys Arg Ser Ala Ser Ala Arg Lys He 

79 75 80 85 

81 aca gcg gtt ata eca aac ttc cct tac gca aga caa gae aaa aag gac 

82 Thr Ala Val He Pro Asn Phe Pro Tyr Ala Arg Gin Asp Lys Lys Asp 

83 90 95 100 105 

85 aag teg cga gca ccg ata act gee aag ctg gtg gcc aag atg eta gag 

86 Lys Ser Arg Ala Pro He Thr Ala Lys Leu Val Ala Lys Met Leu Glu 

87 110 115 120 



89 acc gcg ggg tge aac cac gtt ate acg atg gat ttg cac gcg tct caa 

90 Thr Ala Gly Cys Asn His Val He Thr Met Asp Leu His Ala Ser Gin 



91 125 



130 135 



844 



892 



940 



988 



1036 



1084 



1132 



1180 



1228 



93 att cag ggt ttc ttc cac att eca gtg gac aac eta tat gca gag ccg 

94 He Gin Gly Phe Phe His He Pro Val Asp Asn Leu Tyr Ala Glu Pro 

95 140 145 150 

97 aac ate ctg cac tac ate caa eat aat gtg gac ttc cag aat agt atg 

98 Asn He Leu His Tyr He Gin His Asn Val Asp Phe Gin Asn Ser Met 

99 155 160 165 

101 ttg gtc gcg eca gac gcg ggg teg gcg aag cge acg teg acg ctt teg 

102 Leu val Ala Pro Asp Ala Gly Ser Ala Lys Arg Thr Ser Thr Leu Ser 

103 170 175 180 185 

105 gae aag ctg aat etc aac ttc gcg ttg ate cac aaa gaa egg cag aag 

106 ASP Lys Leu Asn Leu Asn Phe Ala Leu He His Lys Glu Arg Gin Lys 

107 190 195 200 

109 geg aac gag gtc teg egg atg gtg ttg gtg ggt gat gtc gcc gac aag 1276 

110 Ala Asn Glu val Ser Arg Met Val Leu Val Gly Asp Val Ala Asp Lys 

111 205 210 215 

113 tec tgt att att gta gac gac atg geg gae acg tge gga acg eta gtg 

114 ser Cys He He Val Asp Asp Met Ala Asp Thr Cys Gly Thr Leu Val 

115 220 225 230 

117 aag gcc act gac acg ctg ate gaa aat tgt geg aaa gaa gtg att gcc 1372 

118 Lys Ala Thr Asp Thr Leu He Glu Asn Cys Ala Lys Glu Val He Ala 

119 235 240 245 

121 att gtg aca cac ggt ata ttt tct ggc ggc gcc cge gag aag ttg cge 

122 He Val Thr His Gly He Phe Ser Gly Gly Ala Arg Glu Lys Leu Arg 

123 250 255 260 265 

125 aac age aag ctg gca egg ate gta age aca aat acg gtg eca gtg gac 

126 Asn Ser Lys Leu Ala Arg He Val Ser Thr Asn Thr Val Pro Val Asp 

127 270 275 280 

129 etc aat eta gat ate tac cac caa att gae att agt gee att ttg gcc 1516 

130 Leu Asn Leu Asp He Tyr His Gin He Asp He Ser Ala He Leu Ala 

131 285 290 295 

133 gag gca att aga agg ctt cac aac ggg gaa agt gtg tog tac ctg ttc 1564 



1324 



1420 



1468 
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Input Set : As\EP.txt 
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134 Glu Ala He Arg Arg Leu His Asn Gly Glu Ser Val Ser Tyr Leu Phe 

135 300 305 310 

137 aat aac get gtc atg tagtgctgtc agtggcagat gcatgatcgc tggcctaatt 1619 

138 Asn Asn Ala Val Met 

139 315 

141 atctgtgtaa gttgatacaa tgcagtaaat acagtacata aaactgaatg tttttcactt 1679 
143 aggggtgctt tgttgttctg atagcgtgtg tgcgaatttg gaggtgaaag ttgaacatca 1739 
145 cgtaatgaat acaaacaaga ttgcacatta ggaaaagcga taaattattt attatttgca 1799 
147 actggccttt gagcgtttaa gcctgaacat ttttgccctt ttgtttgacc gtaccgttat 1859 
149 cactcgtcct tatatatggc tatccttctc ttccggaact tcttcgagcg ta 1911 

154 <210> SEQ ID NO: 2 

155 <211> LENGTH: 318 

156 <212> TYPE: PRT 

157 <213> ORGANISM: Ashbya gosypii 
159 <400> SEQUENCE: 2 

161 Met Ser Ser Asn Ser He Lys Leu Leu Ala Gly Asn Ser His Pro Asp 

162 15 10 15 

164 Leu Ala Glu Lys Val Ser Val Arg Leu Gly Val Pro Leu Ser Lys He 

165 20 25 30 

167 Gly Val Tyr His Tyr Ser Asn Lys Glu Thr Ser Val Thr He Gly Glu 

168 35 40 45 

170 Ser He Arg Asp Glu Asp Val Tyr He He Gin Thr Gly Thr Gly Glu 

171 50 55 60 

173 Gin Glu He Asn Asp Phe Leu Met Glu Leu Leu He Met He His Ala 

174 65 70 75 80 

176 Cys Arg Ser Ala Ser Ala Arg Lys He Thr Ala Val He Pro Asn Phe 

177 85 90 95 

179 Pro Tyr Ala Arg Gin Asp Lys Lys Asp Lys Ser Arg Ala Pro He Thr 

180 100 105 110 

182 Ala Lys Leu Val Ala Lys Met Leu Glu Thr Ala Gly Cys Asn His Val 

183 115 120 125 

185 He Thr Met Asp Leu His Ala Ser Gin He Gin Gly Phe Phe His He 

186 130 135 140 

188 Pro Val Asp Asn Leu Tyr Ala Glu Pro Asn He Leu His Tyr He Gin 

189 145 150 155 160 

191 His Asn val Asp Phe Gin Asn Ser Met Leu Val Ala Pro Asp Ala Gly 

192 165 170 175 

194 Ser Ala Lys Arg Thr Ser Thr Leu Ser Asp Lys Leu Asn Leu Asn Phe 

195 180 185 190 

197 Ala Leu He His Lys Glu Arg Gin Lys Ala Asn Glu Val Ser Arg Met 

198 195 200 205 

200 Val Leu Val Gly Asp Val Ala Asp Lys Ser Cys He He Val Asp Asp 

201 210 215 220 

203 Met Ala Asp Thr Cys Gly Thr Leu Val Lys Ala Thr Asp Thr Leu He 

204 225 230 235 240 

206 Glu Asn Cys Ala Lys Glu Val He Ala He Val Thr His Gly He Phe 

207 245 250 255 

209 Ser Gly Gly Ala Arg Glu Lys Leu Arg Asn Ser Lys Leu Ala Arg He 

210 260 265 270 
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SEQ ID NO 


3 




LENGTH: 5369 




TYPE: DNA 






ORGANISM: 


Ashbya gossypii 


FEATURE : 






NAME/KEY : 


CDS 




LOCATION: 


55. . 


1482 


FEATURE : 






NAME/KEY : 


CDS 




LOCATION: 


1767 


. .3299 


FEATURE : 






NAME/KEY : 


CDS 




LOCATION: 


3588 


. .4703 


SEQUENCE : 


3 





RAW SEQUENCE LISTING DATE: 04/16/2002 

PATENT APPLICATION: US/10/076,157 TIME: 14:33:26 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\04I62002\J076157.raw 

212 Val Ser Thr Asn Thr Val Pro Val Asp Leu Asn Leu Asp He Tyr His 

213 275 280 285 

215 Gin He Asp He Ser Ala He Leu Ala Glu Ala He Arg Arg Leu His 

216 290 295 300 

218 Asn Gly Glu Ser Val Ser Tyr Leu Phe Asn Asn Ala Val Met 

219 305 310 315 

223 <210> 

224 <211> 

225 <212> 

226 <213> 

228 <220> 

229 <221> 

230 <222> 

232 <220> 

233 <221> 

234 <222> 

236 <220> 

237 <221> 

238 <222> 
240 <400> 

242 aagcttgacc ttggctggca cttgagtcgg cagacaggtg gactaacccg agca atg 57 

243 Met 

244 ^ 
24 6 gat cgt ggt tgt aaa ggt ate tct tat gtg etc agt gca atg gtt ttt 

247 Asp Arg Gly Cys Lys Gly He Ser Tyr Val Leu Ser Ala Met Val Phe 

248 5 10 • 15 

250 cac ata ata ccg att aea ttt gaa ata teg atg gta tgt ggc ata ttg 153 

251 His He He Pro He Thr Phe Glu He Ser Met Val Cys Gly He Leu 

252 20 25 30 

254 aca tac eag ttt ggt get tee ttc get get ata aca tte teg aet atg 

255 Thr Tyr Gin Phe Gly Ala Ser Phe Ala Ala He Thr Phe Ser Thr Met 

256 35 40 45 

258 ett ett tae tee ate ttt act ttc aga aeg aeg gcg tgg cge aea egg 249 

259 Leu Leu Tyr Ser He Phe Thr Phe Arg Thr Thr Ala Trp Arg Thr Arg 

260 50 55 60 65 

262 ttt agg egt gat gcg aae aag get gac aat aag gee get agt gtg gca 297 

263 Phe Arg Arg Asp Ala Asn Lys Ala Asp Asn Lys Ala Ala Ser Val Ala 

264 70 75 80 

266 ttg gat tec eta ata aat ttt gaa get gta aag tat ttc aat aac gag 345 

267 Leu Asp Ser Leu He Asn Phe Glu Ala Val Lys Tyr Phe Asn Asn Glu 

268 85 90 95 

270 aag tac ett gcg gac aag tat cac aea tec ttg atg aag tac egg gat 

271 Lys Tyr Leu Ala Asp Lys Tyr His Thr Ser Leu Met Lys Tyr Arg Asp 

272 100 105 110 

274 tec eag ata aag gtc teg caa teg ctg gcg ttt ttg aac ace ggc eag 441 

275 Ser Gin He Lys Val Ser Gin Ser Leu Ala Phe Leu Asn Thr Gly Gin 

276 115 120 125 

278 aac eta att ttt ace act gca ctg aet gca atg atg tat atg gee tgt 

279 Asn Leu He Phe Thr Thr Ala Leu Thr Ala Met Met Tyr Met Ala Cys 



105 



201 



393 



489 



file://C:\Crf3\Outhold\VsrJ076157.htm 



4/16/02 



Page 5 of 7 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/076,- 157 



DATE: 04/16/2002 
TIME: 14:33:26 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\04162002\J076157.raw 



280 
282 
283 
284 
286 
287 
288 
290 
291 
292 
294 
295 
296 
298 
299 
300 
302 
303 
304 
306 
307 
308 
310 
311 
312 
314 
315 
316 
318 
319 
320 
322 
323 
324 
326 
327 
328 
330 
331 
332 
334 
335 
336 
338 
339 
340 
342 
343 
344 



130 

aat ggt 
Asn Gly 

aat caa 
Asn Gin 

gtc tac 
Val Tyr 



aaa 
Lys 

aac 
Asn 
210 
ttt 
Phe 



ctg 
Leu 
195 
eta 
Leu 

ggc 
Gly 



ate oca 
lie Pro 

aag tec 
Lys Ser 

ggt cgt 

Gly Arg 
275 
tct tta 
Ser Leu 
290 

aat gac 
Asn Asp 

gac gat 
Asp Asp 

etc cag 
Leu Gin 

ttg atg 
Leu Met 
355 
ett ttg 
Leu Leu 
370 

ctg gat 
Leu Asp 



gtt atg 
Val Met 

ctg gta 
Leu Val 
165 
cgt gat 
Arg Asp 
180 

caa aaa 
Gin Lys 

cca ata 
Pro lie 

tat gac 
Tyr Asp 

get gga 
Ala Gly 
245 
acc att 
Thr lie 
260 

ate eta 
lie Leu 

egg aag 
Arg Lys 



aca ate 
Thr lie 



gag att 
Glu He 
325 
aac eta 
Asn Leu 
340 

ate age 
He Ser 

aag gac 
Lys Asp 

aca cae 
Thr His 



135 
cag ggc 
Gin Gly 
150 

ttc cag 
Phe Gin 

etc aag 
Leu Lys 

aat cag 
Asn Gin 

cac aaa 
His Lys 
215 
ccg gag 
Pro Glu 
230 

atg aag 
Met Lys 

ttg aag 
Leu Lys 

gtt ggc 
Val Gly 

get ate 
Ala He 
295 
tgg gag 
Trp Glu 
310 

etc agg 
Leu Arg 



tct ett 
Ser Leu 



etc tec 
Leu Ser 

cag tct 
Gin Ser 
185 
gtc aca 
Val Thr 
200 

ccg ttg 
Pro Leu 

egg cgt 
Arg Arg 

act gee 
Thr Ala 

etc gta 
Leu Val 
265 
ggc aca 
Gly Thr 
280 

ggt gtc 
Gly Val 

aat gtt 
Asn Val 

gee ata 
Ala He 



aca 

Thr 

gtg 
Val 
170 
ctg 
Leu 



gtg 

Vcl J. 

155 
cca 
Pro 

ata 
He 



140 

ggg gat 
Gly Asp 

eta aac 
Leu Asn 

gat atg 
Asp Met 



att aag 

He Lys 

gat att 
Asp He 

ata ttg 
He Leu 
235 
ata gta 
He Val 
250 

ttt aga 
Phe Arg 

gat ate 
Asp He 

gtg ccc 
Val Pro 



aac 
Asn 

cgc 
Arg 
220 
aac 
Asn 



tec 
Ser 
205 
ttt 
Phe 

aat 
Asn 



ett gtg tta 

T 17a 1 T.otl 

160 

ttc ett ggt 
Phe Leu Gly 

175 
gaa tct tta 
Glu Ser Leu 
190 

cca aat gee 
Pro Asn Ala 



145 
att 
He 

age 
Ser 

ttt 
Phe 

cag 
Gin 



cca aag 
Pro Lys 

gga ggt 
Gly Gly 

get ccg 
Ala Pro 
375 
aca gag 
Thr Glu 
390 



ggc get 
Gly Ala 
345 
gag aaa 
Glu Lys 
360 

ctg atg 
Leu Met 

cag gea 
Gin Ala 



aaa 
Lys 

gaa 
Glu 
330 
tec 
Ser 



ttc 
Phe 
315 
aaa 
Lys 

ace 
Thr 



caa agg 
Gin Arg 

ttt ttc 
Phe Phe 

etc ttg 
Leu Leu 
395 



ggc cca 
Gly Pro 

ttc tat 
Phe Tyr 

cgc gat 
Arg Asp 
285 
caa gat 
Gin Asp 
300 

ggc aat 
Gly Asn 

get caa 
Ala Gin 

gtt gta 
Val Val 

ett get 
Leu Ala 
365 
gac gag 
Asp Glu 
380 

cae ace 
His Thr 



gaa aat 
Glu Asn 

gtt teg 
Val Ser 

teg ggc 
Ser Gly 
255 
gag ccc 
Glu Pro 
270 

tta gac 
Leu Asp 



gtt aeg 
Val Thr 
225 
ttt ace 
Phe Thr 
240 

teg ggg 
Ser Gly 

gag caa 
Glu Gin 

ttg ett 
Leu Leu 



act ect etc 
Thr Pro Leu 

ate agt tee 
He Ser Ser 
320 

etc aeg aag 
Leu Thr Lys 

335 
ggg gag cgc 
Gly Glu Arg 
350 

att get cgt 
He Ala Arg 



ttc 
Phe 
305 
tct 
Ser 

eta 
Leu 

ggt 
Gly 

gtg 
Val 



get aca agt get 
Ala Thr Ser Ala 
385 

att cag cag aac 
He Gin Gin Asn 
400 



537 



585 



633 



681 



729 



777 



825 



873 



921 



969 



1017 



1065 



1113 



1161 



1209 



1257 
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VERIFICATION SUMMARY DATE: 04/16/2002 

PATENT APPLICATION: US/10/076,157 TIME: 14; 33: 28 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\04162002\J076157.raw 
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